FLUSH: A Flexible Lexicon Design

نویسندگان

  • David J. Besemer
  • Paul S. Jacobs
چکیده

Approaches to natural language processing that use a phrasal lexicon have the advantage of easily handling linguistic constructions that might otherwise be extragrammatical. However, current phrasal lexicons are often too rigid: their phrasal entries fail to cover the more flexible constructions. FLUSH, for Flexible Lexicon Utilizing Specialized and Hierarchical knowledge, is a knowledge-based lexicon design that allows broad phrasal coverage. I. I n t r o d u c t i o n Natural language processing systems must use a broad range of lexical knowledge to account for the syntactic use and meaning of words and constructs. The problem of understanding is compounded by the fact that language is full of nonproductive constructs--expressions whose meaning is not fully determined by examining their parts. To handle these constructs, some systems use a phrasal lexicon [Becket, 1975, Wilensky and Arena, 1980b, Jacobs, .1985b, Steinacker and Buchberger, 1983, Dyer and Zernik, 1986], a dictionary designed to make the representation of these specialized constructs easier. The problem that phrasal lexicons have is that they are too rigid: the phrasal knowledge is entered in a way that makes it difficult to represent the many forms some expressions may take without treating each form as a distinct "phrase". For example, expressions such as "send a message", "give a hug", "working directory", and "pick up" may be handled as specialized phrases, but this overlooks similar expressions such as "give a message", "get a kiss", "working area", and "take up". Specialized constructs must be recognized, but much of their meaning as well as their flexible linguistic behavior may come from a more general level. A solution to this problem of rigidity is to have a hierarchy of linguistic constructions, with the most specialized phrases grouped in categories with other phrases that behave similarly. The idea of a linguistic hierarchy is not novel, having roots in both linguistics [Lockwood, 1972, Halliday, 1978] and Artificial Intelligence [Sondheimer et al., 1984]. Incorporating phrasal knowledge into such a hierarchy was suggested in some AI work [Wilensky and Arena, 1980a], but the actual implementation of a hier186 archical phrasal lexicon requires substantial extensions to the phrasal representation of such work. The Flexible Lexicon Utilizing Specific and Hierarchical knowledge (FLUSH) is one component in a suite of natural language processing tools being developed at the GE Research and Development Center to facilitate rapid assimilation of natural language processing technology to a wide variety of domains. FLUSH has characteristics of both traditional and phrasal lexicons, and the phrasal portion is partitioned into four classes of phrasal entries:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Implementation of a Software System for Detecting Orthographical or Morphological Errors in Persian Words

This paper presents a new method for analyzing words in the Persian language context to find orthographical and structural errors regardless of the meaning. This technique tokenizes each word in a statement then tries to detect the kind of word, and analyses its correctness in terms of orthography and morphology by means of a lexicon. It should be noted that some words in the Persian language h...

متن کامل

بررسی رفتار و طراحی اتصال خمشی با ورق انتهایی هم تراز به روش اجزاء محدود تحت بارگذاری متناوب

The use of steel flush end-plate moment connection is practiced in the construction of the light steel frames around the world particularly in parts of Europe and US. In the past most research was concentrated on studying the behavior of the flush end-plate connection subjected to only monotonic type loadings. The majority of that research carried out such investigation through experimental pro...

متن کامل

Enhanced Flush+Reload Attack on AES

In cloud computing, multiple users can share the same physical machine that can potentially leak secret information, in particular when the memory de-duplication is enabled. Flush+Reload attack is a cache-based attack that makes use of resource sharing. T-table implementation of AES is commonly used in the crypto libraries like OpenSSL. Several Flush+Reload attacks on T-table implementat...

متن کامل

OPTIMIZATION OF MULTIPLE PANEL FllTlNG IN AUTOMOBILE ASSEMBLY

A systematic approach is presented to obtain improved panel tit quality through the use of an optimum panel fitting strategy. ‘The objective of the optimal1 panel fitting strategy is to determine the location of the panels on the automobile body such that the gap and flush variation of the panel fit are mmimized. This approach uses measurement data from both the panels and the body-inwhite (BIW...

متن کامل

Implications of a kinematic wave model for first flush treatment design.

A deterministic model was developed to predict pollutant mass first flush and to utilize it for better design of best management practices (BMPs) that focus on treating the first flush. The model used the kinematic wave equation to calculate flow and mass transport, and erosion equations to calculate pollutant concentrations, which were assumed to be from a short and a long term source. The mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1987